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SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: BILLING-MEDEL, PATRICIA A. 
COHEN, MAURICE 
COLPITTS, TRACE Y L. 
FRIEDMAN, PAULA N. 
HA YD EN, MARK 
KLASS, MICHAEL R. 
ROBERTS -RAPP, LISA 
RUSSELL, JOHN C. 
STROUPE, STEPHEN D. 



La Hi) TITLE OF THE INVENTION: REAGENTS AND METHODS FOR THE 

P3 USEFUL FOR DETECTING DISEASES OF THE GASTROINTESTINAL 

IS TRACT 

Ld 

fjj (iii) NUMBER OF SEQUENCES : 51 

M (iv) CORRESPONDENCE ADDRESS: 

if! (A) ADDRESSEE: Abbott Laboratories 

J, (B) STREET: 100 Abbott Park Road 

~~~~ J - (C) CITY: Abbott Park 

= (D) STATE: IL 

M (E) COUNTRY: USA 

f L j (F) ZIP: 60064-3500 

^ (v) COMPUTER READABLE FORM: 

yj (A) MEDIUM TYPE: Diskette 

O (B) COMPUTER: IBM Compatible 

U, (C) OPERATING SYSTEM: DOS 

(D) SOFTWARE : FastSEQ for Windows Version 2 . 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/828,856 

(B) FILING DATE: 31-MAR-1997 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Becker, Cheryl L. 

(B) REGISTRATION NUMBER: 35,441. 

(C) REFERENCE/DOCKET NUMBER: 6068. US. PI 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 847/935-1729 

' (B) TELEFAX: 847/938-2623 
(C) TELEX: 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCCAGGAATA ACTAGAGAGG AACAATGGGG TTATTCAGAG GTTTTGTTTT CCTCTTAGTT 60 

CTGTGCCTGC TGCACCAGTC AAATACTTCC TTCATTAAGC TGAATAATAA TGGCTTTGAA 120 

GATATTGTCA TTGTTATAGA TCCTAGTGTG CCAGAAGATG AAAAAATAAT TGAACAAATA 180 

GAGGATATGG TGACTACAGC TTCTACGTAC CTGTTTGAAG CCACAGAAAA AAGATTTTTT 240 

T 241 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
M (A) LENGTH: 219 base pairs 

Q <B) TYPE: nucleic acid 

f=i (C) STRANDEDNESS : single 

pi (D) TOPOLOGY: linear 

U1 

jM <xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

m 

■Z i CTAGAGAGGA ACAATGGGGT TATTCAGAGG TTTTGTTTTC CTCTTAGTTC TGTGCCTGCT 6 0 

- GCACCAGTCA AATACTTCCT TCATTAAGCT GAATAATAAT GGCTTTGAAG ATATTGTCAT 120 

* TGTTATAGAT CCTAGTGTGC CAGAAGATGA AAAAATAATT GAACAAATAG AGGATATGGT 180 

M GACTACAGCT TCTACGTACC TGTTTGAAGC CACAGAAAA 219 

ru 

-~ (2) INFORMATION FOR SEQ ID NO : 3 : 

p s 

yQ (i) SEQUENCE CHARACTERISTICS: 

□ (A) LENGTH: 231 base pairs 

M, (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( ix) FEATURE : 

(A) NAME /KEY : base_polymorphism 
<B) LOCATION: 3 

(D) OTHER INFORMATION: /note= M, N' represents an A or G or 
T or C polymorphism at this position" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTNTGTAACG AAAAAACCCA TAATCAAGAA GCTCCAAGCC TACAAAACAT AAAGTGCAAT 60 

TTTAGAAGTA CATGGGAGGT GATTAGCAAT TCTGAGGATT TTAAAAACAC CATACCCATG 120 

GTGACAC CAC CTCCTCCACC TGTCTTCTCA TTGCTGAAGA TCAGT CAAAG AATTGTGTGC 180 

TTAGTTCTTG ATAAGTCTGG AAGCATGGGG GGTAAGGACC GCCTAAATCG A 231 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
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rrrrrrm AA GGACCGCCTA AATCGAATGA ATCAAGCAGC AAAACATTTC CTGCTGCAGA 60 

Sot?SKaa" ?S3atcctgg GTGGGGATGG TTCACTTTGA TAGTACTGCC ACTATTGTAA 120 

SSE™S JcAAATAAAA AGCAGTGATG AAAGAAACAC ACTCATGGCA GGATTACCTA 180 

CaScCTCT GGGAGGAACT TCCATCTGCT CTGGAATTAA ATATGCATTT CAGGTGA 237 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

rTTrrATCTG CTCTGGAATT AAATATGCAT TTCAGGTGAT TGGAGAGCTA CATTCCCAAC 
SStgSSc Saactactg CTGCTGACTG ATGGGGAGGA TAACACTGCA AGTTCTTGTA 

™a?gaagt gaaacaaagt ggggccattg ttcattttat tgctttggga agagctgctg 
H atgaagcagt aatagagatg agcaagataa caggag 

y (2) INFORMATION FOR SEQ ID NO: 6: 

W (i) SEQUENCE CHARACTERISTICS: 

III (A) LENGTH: 201 base pairs 

|Ia (B) TYPE: nucleic acid 

' m (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



120 
180 
216 



Sj 



o 



(ix) FEATURE: 

(A) NAME/ KEY : base_polymorphism 

(B) LOCATION: 24 

D OTHER INFORMATION: /note- "'N' represents an A or G or 
T or C polymorphism at this position' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AATTGATAGT ACAGTGGGAA AGGNCACGTT CTTTCTCATC ACATGGAACA GTCTGCCTCC 60 

?5ota???2? ctctgggatc CCAGTGGAAC aataatggaa aatttcacag tggatgcaac 120 

5tc?aAaItG GCCTATCTCA GTATTCCAGG AACTGCAAAG GTGGGCACTT GGGCATACAA 180 
TCTTCAAGCC AAAGCGAACC C 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GCAAATTCTT CTGTGCCTCC AATCACAGTG AATGCTAAAA TGAATAAGGA CGTAAACAGT 
SS?J£Sgcc CAATGATTGT TTACGCAGAA ATTCTACAAG GATATGTACC TGTTCTTGGA 

SSSSctga ctgctttcat tgaatcacag aatggacata cagaagtttt ggaacttttg 
ga?a!?gg?g ?Iggcgctga ttctttcaag aatgatggag tctactccag gtattttaca 

G 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 



60 
120 
180 
240 
241 



m 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

™2^™S C TGATTCTTTC AAGAATGATG GAGTCTACTC CAGGTATTTT ACAGCATATA SO 
CAGAAAATGG CAGATATAGC TTAAAAGTTC GGGCTCATGG AGGAGCAAAC ACTGCCAGGC 120 
T AAA i£ TACG GCCTCCACTG AATAGAGCCG CGTACATACC AGGCTGGGTA GTGAACGGGG 180 
AAATTGAAGC AAACCCGCCA AGACCTGAAA TTGATGAGGA TACTCAGACC ACCTTGGAGG 24 0 

242 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

!H < xi > SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

5 ^CGCCAAGA CCTGAAATTG ATGAGGATAC TCAGACCACC TTGGAGGATT TCAGCCGAAC 60 

id AGCATCCGGA GGTGCATTTG TGGTATCACA AGTCCCAAGC CTTCCCTTGC CTGACCAATA 120 

^ SSSSSSSI n^^ ACAG ACCTTGA TGC CACAGTTCAT GAGGATAAGA TTATTCTTAC 180 

M ATGGACAGCA CCAGGAGATA ATTTTGATGT TGGAAAAGTT CAACGTTATA TCA 233 

fan 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS : 



U> (A) LENGTH: 313 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



{ ix) FEATURE : 



S (A) NAME /KEY : base_polymorphism 

= a (B) LOCATION: 22 

(D) OTHER INFORMATION: /note- »«N' represents an A or G or 
T or C polymorphism at this position" 

( ix) FEATURE : 

(A) NAME/KEY: base_polymorphism 

(B) LOCATION: 44 

(D) OTHER INFORMATION: /note- « 4 N' represents an A or G or 
T or C polymorphism at this position" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

^AATACCCA CCAAGTCAAA TNACAGACCT TGATGCCACA GTTNATGAGG ATAAGATTAT 60 

JSSJSKS AGAGCACCAG GAGATAATTT TGATGTTGGA AAAGTTCAAC GTTATATCAT 120 

^^l^l GCAAGTATTC TTGATCTAAG AGACAGTTTT GATGATGCTC TTCAAGTAAA 180 

TACTACTGAT CTGTCACCAA AGGAGGCCAA CTCCAAGGAA AGCTTTGCAT TTAAACCAGA 74n 

GAAGAAAATG CAACCCACAT ATTTATTGCC SjgSSSJ SSSSSS 300 



(2) 



INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AAGTATTCTT GATCTAAGAG ACAGTTTTGA TGATGCTCTT CAAGTAAATA CTACTGATCT 60 

GTCACCAAAG GAGGCCAACT CCAAGGAAAG CTTTGCATTT AAACCAGAAA ATATCTCAGA 120 

AGAAAATGCA ACCCACATAT TTATTGCCAT TAAAAGTATA GATAAAAGCA ATTTGACATC 180 

AAAAGTATCC AACATTGCAC AAGTAACTTT GTTTATCCCT CAAGCAAATC CTGATGACAT 240 

TG 242 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: base_j?olymorphism 

(B) LOCATION: 2 

U> (D) OTHER INFORMATION: /note= "'N' represents an A or G or 

L, T or C polymorphism at this position" 

5 (ix) FEATURE: 

RJ (A) NAME /KEY : basejpolymorphism 

Lfj (B) LOCATION: 4 

(D) OTHER INFORMATION: /note= n, N' represents an A or G or 
* T or C polymorphism at this position" 

yl 

SJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

L ANANAATGCA ACCCACATAT TTATTGCCAT TAAAAGTATA GATAAAAGCA ATTTGACATC 60 

L. AAAAGTATCC AACATTGCAC AAGTAACTTT GTTTATCCCT CAAGCAAATC CTGATGACAT 120 

\~ TGATCCTACT CCTACTCCTA CTCCTACTCC TGATAAAAGT CATAATTCTG GAGTTAATAT 180 

^ TTCTACGCTG GTATTGTCTG TGATTGGG 208 

□ (2) INFORMATION FOR SEQ ID NO: 13: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: aAngle 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CTCCTACTCC TACTCCTGAT AAAAGTCATA ATTCTGGAGT TAATATTTCT ACGCTGGTAT 60 

TGTCTGTGAT TGGGTCTGTT GTAATTGTTA ACTTTATTTT AAGTACCACC ATTTGAACCT 12 0 

TAACGAAGAA AAAAATCTTC AAGTAGACCT AGAAGAGAGT TTTAAAAAAC AAAACAATGT 180 
AAGTAAAGGA TATTTCTGAA T 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 

( ix) FEATURE : 

(A) NAME/KEY: basejpolymorphism 

(B) LOCATION: 111 

(D) OTHER INFORMATION: /note= W, N' represents an A or G or 
T or C polymorphism at this position" 



(ix) FEATURE: 

(A) NAME /KEY base_polymorphism 

(B) LOCATION: 244 

(D) OTHER INFORMATION: /note= " 'N' represents an A or G or 
T or C polymorphism at this position" 

( ix) FEATURE : 

(A) NAME /KEY : base_polymorphism 

(B) LOCATION: 284 

(D) OTHER INFORMATION: /note= n, N' represents an A or G or 
T or C polymorphism at this position" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TCTGTTGTAA TTGTTAACTT TATTTTAAGT ACCACCATTT GAACCTTAAC GAAGAAAAAA 60 

ATCTTCAAGT AGACCTAGAA GAGAGTTTTA AAAAACAAAA CAATGTAAGT NAAGGATATT 120 

TCTGAATCTT AAAATTCATC CCATGTGTGA TCATAAACTC ATAAAAATAA TTTTAAGATG 180 

TCGGAAAAGG ATACTTTGAT TAAATAAAAA CACTCATGGA TATGTAAAAA CTGTCAAGAT 240 

TAANATTTAA TAGTTTCATT TATTTGTTAT TTTATTTGTA AGANATAGTG ATGAACAAAG 300 

A 301 

Q 

q (2) INFORMATION FOR SEQ ID NO: 15: 

0= (i) SEQUENCE CHARACTERISTICS: 

Ul (A) LENGTH: 22 9 base pairs 

M (B) TYPE: nucleic acid 

m (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

M (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

jj: GAAAAAAATC TTCAAGTAGA CCTAGAAGAG AGTTTTAAAA AACAAAACAA TGTAAGTAAA 60 

= GGATATTTCT GAATCTTAAA ATTCATCCCA TGTGTGATCA TAAACTCATA AAAATAATTT 120 

m TAAGATGTCG GAAAAGGATA CTTTGATTAA ATAAAAACAC TCATGGATAT GTAAAAACTG 180 

Q TCAAGATTAA AATTTAATAG TTTCATTTAT TTGTTATTTT ATTTGTAAG 229 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3043 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CTAGAGAGGA ACAATGGGGT TATTCAGAGG TTTTGTTTTC CTCTTAGTTC TGTGCCTGCT 60 

GCACCAGTCA AATACTTCCT TCATTAAGCT GAATAATAAT GGCTTTGAAG ATATTGTCAT 120 

TGTTATAGAT CCTAGTGTGC CAGAAGATGA AAAAATAATT GAACAAATAG AGGATATGGT 180 

GACTACAGCT TCTACGTACC TGTTTGAAGC CACAGAAAAA AGATTTTTTT TCAAAAATGT 24 0 

ATCTATATTA ATTCCTGAGA ATTGGAAGGA AAATCCTCAG TACAAAAGGC CAAAACATGA 3 00 

AAACCATAAA CATGCTGATG TTATAGTTGC ACCACCTACA CTCCCAGGTA GAGATGAACC 36 0 

ATACACCAAG CAGTTCACAG AATGTGGAGA GAAAGGCGAA TACATTCACT TCACCCCTGA 420 

CCTTCTACTT GAAAAAAAAC AAAATGAATA TGGACCACCA GGCAAACTGT TTGTCCATGA 480 

GTGGGCTCAC CTCCGGTGGG GAGTGTTTGA TGAGTACAAT GAAGATCAGC CTTTCTACCG 540 

TGCTAAGTCA AAAAAAATCG AAGCAACAAG GTGTTCCGCA GGTATCTCTG GTAGAAATAG 600 

AGTTTATAAG TGTCAAGGAG GCAGCTGTCT TAGTAGAGCA TGCAGAATTG ATTCTACAAC 660 

AAAACTGTAT GGAAAAGATT GTCAATTCTT TCCTGATAAA GTACAAACAG AAAAAGCATC 720 

CATAATGTTT ATGCAAAGTA TTGATTCTGT TGTTGAATTT TGTAACGAAA AAACCCATAA 780 

TCAAGAAGCT CCAAGCCTAC AAAACATAAA GTGCAATTTT AGAAGTACAT GGGAGGTGAT 84 0 

TAGCAATTCT GAGGATTTTA AAAACACCAT ACCCATGGTG ACACCACCTC CTCCACCTGT 900 



p 

d 
nj 
Hi 

m 

NJ 



CTTCTCATTG 
CATGGGGGGT 
GACTGTTGAA 
AAATAAGCTA 
TACATATCCT 
TGGAGAGCTA 
TAACACTGCA 
TG CTTTGGGA 
TCATTTTTAT 
TACATCAGGA 
ACTGAATAGT 
CACGTTCTTT 
TGGAACAATA 
TCCAGGAACT 
AACATTAACT 
GAATGCTAAA 
AATTCTACAA 
GAATGGACAT 
GAATGATGGA 
AAAAGTTCGG 
TAGAGCCGCG 
ACCTGAAATT 
AGGTGCATTT 
TCAAATCACA 
ACCAGGAGAT 
TATTCTTGAT 
ACCAAAGGAG 
AAATG CAAC C 
AGTATCCAAC 
TCCTACTCCT 
TACGCTGGTA 
CATTTGAACC 
CAAAACAATG 
AACTCATAAA 
ATGGATATGT 
TTGTAAGAAA 



CTGAAGATCA 
AAGGACCGCC 
AATGGATCCT 
ATCCAAATAA 
CTGGGAGGAA 
CATTCCCAAC 
AGTTCTTGTA 
AGAGCTGCTG 
GTTTCAGATG 
AATACTGATC 
AATGCCTGGA 
CTCATCACAT 
ATGGAAAATT 
GCAAAGGTGG 
ATTACAGTAA 
ATGAATAAGG 
GGATATGTAC 
ACAGAAGTTT 
GTCTACTCCA 
GCTCATGGAG 
TACATACCAG 
GATGAGGATA 
GTGGTATCAC 
GACCTTGATG 
AATTTTGATG 
CTAAGAGACA 
GCCAACTCCA 
CACATATTTA 
ATTGCACAAG 
ACTCCTACTC 
TTGTCTGTGA 
TTAACGAAGA 
TAAGTAAAGG 
AATAATTTTA 
AAAAACTGTC 
TAGTGATGAA 



GTCAAAGAAT 
TAAATCGAAT 
GGGTGGGGAT 
AAAGCAGTGA 
CTTCCATCTG 
TCGATGGATC 
TTGATGAAGT 
ATGAAGCAGT 
AAGCTCAGAA 
TCTCCCAGAA 
TGAACGACAC 
GGAACAGTCT 
TCACAGTGGA 
GCACTTGGGC 
CTTCTCGAGC 
ACGTAAACAG 
CTGTTCTTGG 
TGGAACTTTT 
GGTATTTTAC 
GAGCAAACAC 
GCTGGGTAGT 
CTCAGACCAC 
AAGTCCCAAG 
CCACAGTTCA 
TTGGAAAAGT 
GTTTTGATGA 
AGGAAAGCTT 
TTGCCATTAA 
TAACTTTGTT 
CTACTCCTGA 
TTGGGTCTGT 
AAAAAATCTT 
ATATTTCTGA 
AGATGTCGGA 
AAGATTAAAA 
CAAAGATCCT 



TGTGTGCTTA 
GAAT C AAGCA 
GGTTCACTTT 
TGAAAGAAAC 
CTCTGGAATT 
CGAAGTACTG 
GAAACAAAGT 
AATAGAGATG 
CAATGGCCTC 
GTCCCTTCAG 
TGTCATAATT 
GCCTCCCAGT 
TGCAACTTCC 
ATACAATCTT 
AGCAAATTCT 
TTTCCCCAGC 
AGCCAATGTG 
GGATAATGGT 
AGCATATACA 
TGCCAGGCTA 
GAACGGGGAA 
CTTGGAGGAT 
CCTTCCCTTG 
TGAGGATAAG 
TCAACGTTAT 
TGCTCTTCAA 
TGCATTTAAA 
AAGTATAGAT 
TATCCCTCAA 
TAAAAGTCAT 
TGTAATTGTT 
CAAGTAGACC 
ATCTTAAAAT 
AAAGGATACT 
TTTAATAGTT 
TTTTCATACT 



GTTCTTGATA 
GCAAAACATT 
GATAGTACTG 
ACACTCATGG 
AAATATGCAT 
CTGCTGACTG 
GGGGCCATTG 
AGCAAGATAA 
ATTGATGCTT 
CTCGAAAGTA 
GATAGTACAG 
ATTTCTCTCT 
AAAATGGCCT 
CAAGCCAAAG 
TCTGTGCCTC 
CCAATGATTG 
ACTGCTTTCA 
GCAGGCGCTG 
GAAAATGGCA 
AAATTACGGC 
ATTGAAGCAA 
TTCAGCCGAA 
CCTGACCAAT 
ATTATTCTTA 
ATCATAAGAA 
GTAAATACTA 
C C AG AAAAT A 
AAAAGCAATT 
GCAAATCCTG 
AATTCTGGAG 
AACTTTATTT 
TAGAAGAGAG 
TCATCCCATG 
TTGATTAAAT 
TCATTTATTT 
GAT 



AGTCTGGAAG 
TCCTGCTGCA 
CCACTATTGT 
CAGGATTACC 
TTCAGGTGAT 
ATGGGGAGGA 
TTCATTTTAT 
CAGGAGGAAG 
TTGGGGCTCT 
AGGGATTAAC 
TGGGAAAGGA 
GGGATCCCAG 
ATCTCAGTAT 
CGAACCCAGA 
CAATCACAGT 
TTTACGCAGA 
TTGAATCACA 
ATTCTTTCAA 
GATATAGCTT 
CTCCACTGAA 
ACCCGCCAAG 
CAGCATCCGG 
ACCCACCAAG 
CATGGACAGC 
TAAGTGCAAG 
CTGATCTGTC 
TCTCAGAAGA 
TGACATCAAA 
ATGACATTGA 
TTAATATTTC 
TAAGTACCAC 
TTTTAAAAAA 
TGTGATCATA 
AAAAACACTC 
GTTATTTTAT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3043 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GCAAATTCTT 
TTCCCCAGCC 
GCCAATGTGA 
GATAATGGTG 
GCATATACAG 
GCCAGGCTAA 
AACGGGGAAA 
TTGGAGGATT 
CTTCCCTTGC 
GAGGATAAGA 
CAACGTTATA 
GCTCTTCAAG 
GCATTTAAAC 
AGTATAGATA 
ATCCCTCAAG 
AAAAGTCATA 
GTAATTGTTA 



CTGTGCCTCC 
CAATGATTGT 
CTGCTTTCAT 
CAGGCGCTGA 
AAAATGG C AG 
AATTACGGCC 
TTGAAGCAAA 
TCAGCCGAAC 
CTGACCAATA 
TTATTCTTAC 
TCATAAGAAT 
TAAATACTAC 
CAGAAAATAT 
AAAGCAATTT 
CAAATCCTGA 
ATTCTGGAGT 
ACTTTATTTT 



AATCACAGTG 
TTACGCAGAA 
TGAATCACAG 
TTCTTTCAAG 
ATATAGCTTA 
TCCACTGAAT 
CCCGCCAAGA 
AGCATCCGGA 
CCCACCAAGT 
ATGGACAGCA 
AAGTGCAAGT 
TGATCTGTCA 
CTCAGAAGAA 
GACATCAAAA 
TGACATTGAT 
TAATATTTCT 
AAGTACCACC 



AATGCTAAAA 
ATTCTACAAG 
AATGGACATA 
AATGATGGAG 
AAAGTTCGGG 
AGAGCCGCGT 
CCTGAAATTG 
GGTGCATTTG 
CAAATCACAG 
CCAGGAGATA 
ATTCTTGATC 
CCAAAGGAGG 
AATGCAACCC 
GTATCCAACA 
CCTACTCCTA 
ACGCTGGTAT 
ATTTGAACCT 



TGAATAAGGA 
GATATGTACC 
CAGAAGTTTT 
TCTACTCCAG 
CTCATGGAGG 
ACATACCAGG 
ATGAGGATAC 
TGGTATCACA 
ACCTTGATGC 
ATTTTGATGT 
TAAGAGACAG 
CCAACTCCAA 
ACATATTTAT 
TTGCACAAGT 
CTCCTACTCC 
TGTCTGTGAT 
TAACGAAGAA 



CGTAAACAGT 
TGTTCTTGGA 
GGAACTTTTG 
GTATTTTACA 
AGCAAACACT 
CTGGGTAGTG 
TCAGACCACC 
AGTCCCAAGC 
CACAGTTCAT 
TGGAAAAGTT 
TTTTGATGAT 
GGAAAGCTTT 
TGCCATTAAA 
AACTTTGTTT 
TACTCCTGAT 
TGGGTCTGTT 
AAAAATCTTC 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



AAGTAGACCT AGAAGAGAGT TTTAAAAAAC AAAACAATGT AAGTAAAGGA TATTTCTGAA 1080 

TCTTAAAATT CATCCCATGT GTGATCATAA ACTCATAAAA ATAATTTTAA GATGTCGGAA 114 0 

AAGGATACTT TGATTAAATA AAAACACTCA TGGATATGTA AAAACTGTCA AGATTAAAAT 1200 

TTAATAGTTT CATTTATTTG TTATTTTATT TGTAAGAAAT AGTGATGAAC AAAGATCCTT 1260 

TTTCATACTG ATACCTGGTT GTATATTATT TGATGCAACA GTTTTCTGAA ATGATATTTC 1320 

AAATTGC AT C AAGAAATTAA AATCATCTAT CTGAGTAGTC AAAATACAAG TAAAGGAGAG 13 80 
CAAATAAACA ACATTTGGA 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 



60 
120 



GCCAGGAATA ACTAGAGAGG AACAATGGGG TTATTCAGAG GTTTTGTTTT CCTCTTAGTT 
CTGTGCCTGC TGCACCAGTC AAATACTTCC TTCATTAAGC TGAATAATAA TGGCTTTGAA 

StStGTCA TTGTTATAGA TCCTAGTGTG CCAGAAGATG AAAAAATAAT TGAACAAATA 180 

GAGGATATGG TGACTACAGC TTCTACGTAC CTGTTTGAAG CCACAGAAAA AAGATTTTTT 240 

TTCAAAAATG TATCTATATT AATTCCTGAG AATTGGAAGG AAAATCCTCA GTACAAAAGG 300 

CCAAAACATG AAAAC C AT AA ACATGCTGAT GTTATAGTTG CACCACCTAC ACTCCCAGGT 360 

AGAGATGAAC CATACACCAA GCAGTTCACA GAATGTGGAG AGAAAGGCGA ATACATTCAC 420 

TTCACCCCTG ACCTTCTACT TGAAAAAAAA CAAAATGAAT ATGGACCACC AGGCAAACTG 48 0 

TTTGTCCATG AGTGGGCTCA CCTCCGGTGG GGAGTGTTTG ATGAGTACAA TGAAGATCAG 54 0 

CCTTTCTACC GTGCTAAGTC AAAAAAAATC GAAGCAACAA GGTGTTCCGC AGGTATCTCT 600 

GGTAGAAATA GAGTTTATAA GTGTCAAGGA GGCAGCTGTC TTAGTAGAGC ATGCAGAATT 660 

GATTCTACAA C AAAAC TGTA TGGAAAAGAT TGTCAATTCT TTCCTGATAA AGTACAAACA 720 

GAAAAAGCAT CCATAATGTT TATGCAAAGT ATTGATTCTG TTGTTGAATT TTGTAACGAA 780 

AAAACCCATA ATCAAGAAGC TCCAAGCCTA CAAAACATAA AGTGCAATTT TAGAAGTACA 84 0 

t^aSotga TTAGCAATTC TGAGGATTTT AAAAACACCA TACCCATGGT GACACCACCT 900 

CCTCCACCTG TCTTCTCATT GCTGAAGATC AGTCAAAGAA TTGTGTGCTT AGTTCTTGAT 960 

AAGTCTGGAA GCATGGGGGG TAAGGACCGC CTAAATCGAA TGAATCAAGC AGCAAAACAT 1020 

TTCCTGCTGC AGACTGTTGA AAATGGATCC TGGGTGGGGA TGGTTCACTT TGATAGTACT 1080 

GCCACTATTG TAAATAAGCT AATCCAAATA AAAAG C AGTG ATGAAAGAAA CACACTCATG 1140 

GCAGGATTAC CTACATATCC TCTGGGAGGA ACTTCCATCT GCTCTGGAAT TAAATATGCA 1200 

TTTCAGGTGA TTGGAGAGCT ACATTCCCAA CTCGATGGAT CCGAAGTACT GCTGCTGACT 1260 

GATGGGGAGG ATAACACTGC AAGTTCTTGT ATTGATGAAG TGAAACAAAG TGGGGCCATT 1320 

GTTCATTTTA TTGCTTTGGG AAGAGCTGCT GATGAAGCAG TAATAGAGAT GAGCAAGATA 1380 

ACAGGAGGAA GTCATTTTTA TGTTTCAGAT GAAGCTCAGA ACAATGGCCT CATTGATGCT 1440 

TTTGGGGCTC TTACATCAGG AAATACTGAT CTCTCCCAGA AGTCCCTTCA GCTCGAAAGT 1500 

AAGGGATTAA CACTGAATAG TAATGC CTGG ATGAACGACA CTGTCATAAT TGATAGTACA 1560 

GTGGGAAAGG ACACGTTCTT TCTCATCACA TGGAACAGTC TGCCTCCCAG TATTTCTCTC 162 0 

TGGGATCCCA GTGGAACAAT AATGGAAAAT TTCACAGTGG ATGCAACTTC CAAAATGGCC 1680 

TATCTCAGTA TTCCAGGAAC TGCAAAGGTG GGCACTTGGG CATACAATCT TCAAGCCAAA 1740 

GCGAACCCAG AAACATTAAC TATTACAGTA ACTTCTCGAG CAGCAAATTC TTCTGTGCCT 1800 

CCAATCACAG TGAATGCTAA AATGAATAAG GACGTAAACA GTTTCCCCAG CCCAATGATT 1860 

GTTTACGCAG AAATTCTACA AGGATATGTA CCTGTTCTTG GAGCCAATGT GACTGCTTTC 1920 

ATTGAATCAC AGAATGGACA TACAGAAGTT TTGGAACTTT TGGATAATGG TGCAGGCGCT 1980 

GATTCTTTCA AGAATGATGG AGTCTACTCC AGGTATTTTA CAGCATATAC AGAAAATGGC 204 0 

AG AT AT AG C T TAAAAGTTCG GGCTCATGGA GGAGCAAACA CTGCCAGGCT AAAATTACGG 210 0 

CCTCCACTGA ATAGAGCCGC GTACATACCA GGCTGGGTAG TGAACGGGGA AATTGAAGCA 216 0 

AACCCGCCAA GACCTGAAAT TGATGAGGAT ACTCAGACCA C CTTGGAGG A TTTCAGCCGA 222 0 

ACAGCATCCG GAGGTGCATT TGTGGTATCA CAAGTCCCAA GCCTTCCCTT GCCTGACCAA 2 280 

TACCCACCAA GTCAAATCAC AGACCTTGAT GCCACAGTTC ATGAGGATAA GATTATTCTT 2 34 0 

ACATGGACAG CACCAGGAGA TAATTTTGAT GTTGGAAAAG TTCAACGTTA TATCATAAGA 24 00 

ATAAGTGCAA GTATTCTTGA TCTAAGAGAC AGTTTTGATG ATGCTCTTCA AGTAAATACT 2460 

ACTGATCTGT C AC C AAAGG A GGCCAACTCC AAGGAAAGCT TTGCATTTAA AC C AG AAAAT 2520 

ATCTCAGAAG AAAATGCAAC CCACATATTT ATTGCCATTA AAAGTATAGA TAAAAGCAAT 2580 

TTGACATCAA AAGTATCCAA CATTGCACAA GTAACTTTGT TTATCCCTCA AGCAAATCCT 264 0 

GATGACATTG ATCCTACTCC TACTCCTACT CCTACTCCTG ATAAAAGTCA TAATTCTGGA 2700 

GTTAATATTT CTACGCTGGT ATTGTCTGTG ATTGGGTCTG TTGTAATTGT TAACTTTATT 2760 

TTAAGTACCA CCATTTGAAC CTTAACGAAG AAAAAAATCT T C AAGT AG AC CTAGAAGAGA 2 820 



GTTTTAAAAA ACAAAACAAT GTAAGTAAAG GATATTTCTG AATCTTAAAA TTCATCCCAT 2880 

GTGTGATCAT AAACTCATAA AAATAATTTT AAGATGTCGG AAAAGGATAC TTTGATTAAA 294 0 

TAAAAACACT CATGGATATG TAAAAACTGT CAAGATTAAA ATTTAATAGT TTCATTTATT 30 00 

TGTTATTTTA TTTGTAAGAA ATAGTGATGA ACAAAGATCC TTTTTCATAC TGATACCTGG 3 060 

TTGTATATTA TTTGATGCAA CAGTTTTCTG AAATGATATT TCAAATTGCA TCAAGAAATT 3120 

AAAAT CAT C T ATCTGAGTAG TCAAAATACA AGTAAAGGAG AGCAAATAAA CAACATTTGG 3180 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGCTCGGAAT TCCGAGCTTG GATCCTCTAG AGCGGCCGCC GACTAGTGAG CTCGTCGACC 
CGGGAATT 

J (2) INFORMATION FOR SEQ ID NO: 20: 

3 (i) SEQUENCE CHARACTERISTICS: 

fU (A) LENGTH: 68 base pairs 

|_q (B) TYPE: nucleic acid 

r. (C) STRANDEDNESS: single 

jLj (D) TOPOLOGY: linear 

%i 

= (xi) SEQUENCE DESCRIPTION : SEQ ID NO:20: 

K AATTAATTCC CGGGTCGACG AGCTCACTAG TCGGCGGCCG CTCTAGAGGA TCCAAGCTCG 

1 ~ GAATTCCG 

M 

03 (2) INFORMATION FOR SEQ ID NO: 21: 

P (i) SEQUENCE CHARACTERISTICS: 

H (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGCGGATAAC AATTTCACAC AGGA 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TGTAAAACGA CGGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
CTGCCAGGCT AAAATTACGG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATCACAGACC TTGATGCCAC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GCTGGTATTG TCTGTGATTG GGTC 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CATCAGGATT TG CTTGAGGG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TATTGGTCAG GCAAGGGAAG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTGTTTGCTC CTCCATGAGC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CAAGTAGAAG GTCAGGGGTG 

(2) INFORMATION FOR SEQ ID NO: 30: 

3 (i) SEQUENCE CHARACTERISTICS: 
□ (A) LENGTH: 2 0 base pairs 

flj (B) TYPE: nucleic acid 

= 5 (C) STRANDEDNESS: single 

rV (D) TOPOLOGY: linear 

m 

SJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

j\ ATAAGTGTCA AGGAGGCAGC 

nJ ( 2 ) INFORMATION FOR SEQ ID NO: 31: 

M 

lg (i) SEQUENCE CHARACTERISTICS: 

i (A) LENGTH: 20 base pairs 

■=■■ (B) TYPE: nucleic acid 

M (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GCAGACTGTT CCATGTGATG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
ATGTACCTGT TCTTGGAGCC 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
ACGTACCTGT TTGAAGCCAC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

GGTAAGGACC GCCTAAATCG 

M (2) INFORMATION FOR SEQ ID NO: 35: 

S (i) SEQUENCE CHARACTERISTICS: 
|jj (A) LENGTH: 20 base pairs 

[y (B) TYPE: nucleic acid 

Ul (C) STRANDEDNESS: single 

U (D) TOPOLOGY: linear 

m 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
M GAAGTGAAAC AAAGTGGGGC 

\Z (2) INFORMATION FOR SEQ ID NO: 36: 

'JO (i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 2 0 base pairs 

iJj (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
TTATCCTCCC CATCAGTCAG 

(2) INFORMATION FOR SEQ ID NO: 37: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
TCGATTTAGG CGGTCCTTAC 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TGTGGCTTCA AACAGGTACG 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 5 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GGGTAAGGAC CGCCTAAATC GAATG 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
GAGCCCCAAA AGCATCAATG AGG 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
Met Gly Leu Phe Arg Gly Phe Val Phe Leu Leu Val Leu Cys Leu Leu 
His Gin Ser Asn Thr Ser Phe He Lys Leu Asn Asn Asn Gly Phe Glu 
Asp He Val iL Val He Asp Pro Ser Val Pro Glu Asp Glu Lys He 
He Glu Gin He Glu Asp Met Val Thr Thr Ala Ser Thr Tyr Leu Phe 
Glu ila Thr Glu Lys Arg Phe Phe Phe Lys Asn Val Ser He Leu He 
llo Glu Asn Trp Lys Su Asn Pro Gin Tyr Lys Arg Pro Lys His Glu 
Asn His Lys His Ala Asp Val He Val Ala Pro Pro Thr Leu Pro Gly 
Arg Asp Glu Pro Tyr Thr Lys Gin Phe Thr Glu Cys Gly Glu Lys Gly 
Glu Tyr He His Phe Thr Pro Asp Leu Leu Leu Glu Lys Lys Gin Asn 
Glu Hr Gly Pro Pro Gly IH Leu Phe Val His Glu Trp Ala His Leu 
145 150 
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Arg Trp Gly Val Phe Asp Glu Tyr Asn Glu Asp Gin Pro Phe Tyr Arg 

165 170 175 

Ala Lys Ser Lys Lys He Glu Ala Thr Arg Cys Ser Ala Gly He Ser 

180 185 ' 190 

Gly Arg Asn Arg Val Tyr Lys Cys Gin Gly Gly Ser Cys Leu Ser Arg 

195 200 205 

Ala Cys Arg He Asp Ser Thr Thr Lys Leu Tyr Gly Lys Asp Cys Gin 

210 215 220 

Phe Phe Pro Asp Lys Val Gin Thr Glu Lys Ala Ser He Met Phe Met 
225 230 235 240 

Gin Ser He Asp Ser Val Val Glu Phe Cys Asn Glu Lys Thr His Asn 

245 250 255 

Gin Glu Ala Pro Ser Leu Gin Asn He Lys Cys Asn Phe Arg Ser Thr 

260 265 * 270 

Trp Glu Val He Ser Asn Ser Glu Asp Phe Lys Asn Thr He Pro Met 

275 280 * 285 

Val Thr Pro Pro Pro Pro Pro Val Phe Ser Leu Leu Lys He Ser Gin 

290 295 300 

Arg He Val Cys Leu Val Leu Asp Lys Ser Gly Ser Met Gly Gly Lys 
305 310 315 320 

Asp Arg Leu Asn Arg Met Asn Gin Ala Ala Lys His Phe Leu Leu Gin 

325 330 335 

Thr Val Glu Asn Gly Ser Trp Val Gly Met Val His Phe Asp Ser Thr 

340 345 350 

Ala Thr He Val Asn Lys Leu He Gin He Lys Ser Ser Asp Glu Arg 

355 360 365 

Asn Thr Leu Met Ala Gly Leu Pro Thr Tyr Pro Leu Gly Gly Thr Ser 

370 375 380 

He Cys Ser Gly He Lys Tyr Ala Phe Gin Val He Gly Glu Leu His 
385 390 395 400 

Ser Gin Leu Asp Gly Ser Glu Val Leu Leu Leu Thr Asp Gly Glu Asp 

405 410 415 

Asn Thr Ala Ser Ser Cys He Asp Glu Val Lys Gin Ser Gly Ala He 

420 425 430 

Val His Phe He Ala Leu Gly Arg Ala Ala Asp Glu Ala Val He Glu 

435 440 445 

Met Ser Lys He Thr Gly Gly Ser His Phe Tyr Val Ser Asp Glu Ala 

450 455 460 

Gin Asn Asn Gly Leu He Asp Ala Phe Gly Ala Leu Thr Ser Gly Asn 
465 470 475 480 

Thr Asp Leu Ser Gin Lys Ser Leu Gin Leu Glu Ser Lys Gly Leu Thr 

485 490 495 

Leu Asn Ser Asn Ala Trp Met Asn Asp Thr Val He He Asp Ser Thr 

500 505 510 

Val Gly Lys Asp Thr Phe Phe Leu He Thr Trp Asn Ser Leu Pro Pro 

515 520 525 

Ser He Ser Leu Trp Asp Pro Ser Gly Thr He Met Glu Asn Phe Thr 

530 535 540 

Val Asp Ala Thr Ser Lys Met Ala Tyr Leu Ser He Pro Gly Thr Ala 
545 550 555 560 

Lys Val Gly Thr Trp Ala Tyr Asn Leu Gin Ala Lys Ala Asn Pro Glu 

565 570 575 

Thr Leu Thr He Thr Val Thr Ser Arg Ala Ala Asn Ser Ser Val Pro 

580 585 590 

Pro He Thr Val Asn Ala Lys Met Asn Lys Asp Val Asn Ser Phe Pro 

595 600 605 

Ser Pro Met He Val Tyr Ala Glu lie Leu Gin Gly Tyr Val Pro Val 

610 615 620 

Leu Gly Ala Asn Val Thr Ala Phe He Glu Ser Gin Asn Gly His Thr 
625 ' 630 635 640 

Glu Val Leu Glu Leu Leu Asp Asn Gly Ala Gly Ala Asp Ser Phe Lys 

645 650 * 655 

Asn Asp Gly Val Tyr Ser Arg Tyr Phe Thr Ala Tyr Thr Glu Asn Gly 

660 665 670 

Arg Tyr Ser Leu Lys Val Arg Ala His Gly Gly Ala Asn Thr Ala Arg 
675 680 685 



Leu Lys Leu Arg Pro Pro Leu Asn Arg Ala Ala Tyr He Pro Gly Trp 

690 " 695 700 

Val Val Asn Gly Glu He Glu Ala Asn Pro Pro Arg Pro Glu He Asp 
705 " 710 715 720 

Glu Asp Thr Gin Thr Thr Leu Glu Asp Phe Ser Arg Thr Ala Ser Gly 

725 730 735 

Glv Ala Phe Val Val Ser Gin Val Pro Ser Leu Pro Leu Pro Asp Gin 

740 745 750 

Tvr Pro Pro Ser Gin He Thr Asp Leu Asp Ala Thr Val His Glu Asp 

755 760 765 

Lys He He Leu Thr Trp Thr Ala Pro Gly Asp Asn Phe Asp Val Gly 



770 



775 780 



Lys Val Gin Arg Tyr He He Arg He Ser Ala Ser He Leu Asp Leu 
785 790 795 800 

Arq Asp Ser Phe Asp Asp Ala Leu Gin Val Asn Thr Thr Asp Leu Ser 

805 810 815 

Pro Lys Glu Ala Asn Ser Lys Glu Ser Phe Ala Phe Lys Pro Glu Asn 

820 825 830 

He Ser Glu Glu Asn Ala Thr His He Phe He Ala lie Lys Ser He 

835 840 845 

Asp Lys Ser Asn Leu Thr Ser Lys Val Ser Asn He Ala Gin Val Thr 
La 850 855 860 

O Leu Phe He Pro Gin Ala Asn Pro Asp Asp He Asp Pro Thr Pro Thr 

865 870 875 880 

=j pro Thr Pro Thr p r o Asp Lys Ser His Asn Ser Gly Val Asn lie Ser 

IU 885 890 895 

Ul Thr Leu Val Leu Ser Val He Gly Ser Val Val He Val Asn Phe He 

L_§ 900 905 910 

b Leu Ser Thr Thr He 

NJ 915 

s (2) INFORMATION FOR SEQ ID NO: 42: 

M 

nj (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

yQ (C) STRANDEDNESS : single 

□ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Ala Asn Ser Ser Val Pro Pro He Thr Val Asn Ala Lys Met Asn Lys 

1 5 10 15 

Asp Val Asn Ser Phe 

20 

(2) INFORMATION FOR SEQ ID NO -.43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Asp Asn Gly Ala Gly Ala Asp Ser Phe Lys Asn Asp Gly Val Tyr Ser 

1 5 1° Xb 

Arg Tyr Phe Thr Ala Tyr Thr Glu Asn Gly Arg Tyr Ser Leu Lys 



(2) INFORMATION FOR SEQ ID NO : 44 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Val Arg Ala His Gly Gly Ala Asn Thr Ala Arg Leu Lys Leu Arg Pro 

15 10 15 

Pro Leu Asn Arg Ala Ala Tyr lie 
20 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Ser Leu Pro Leu Pro Asp Gin Tyr Pro Pro Ser Gin lie Thr Asp Leu 

15 10 15 

Asp Ala Thr Val His Glu Asp Lys He He Leu Thr Trp Thr Ala Pro 

20 25 30 

Gly Asp Asn Phe Asp Val Gly Lys 
35 40 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Tyr Asn Glu Asp Gin Pro Phe Tyr Arg Ala Lys Ser Lys Lys He Glu 

15 10 15 

Ala Thr Arg Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



Leu Ser Arg Ala Cys Arg He Asp Ser Thr Thr Lys Leu Tyr Gly Lys 

1 5 10 15 

Asp Cys Gin Phe Phe Pro Asp Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-48: 

Lvs Ser Ser Asp Glu Arg Asn Thr Leu Met Ala Gly Leu Pro Thr Tyr 

1 5 10 15 

Pro Leu Gly Gly 

M 20 

O 

a (2) INFORMATION FOR SEQ ID NO: 49: 

i M (i) SEQUENCE CHARACTERISTICS: 

01 (A) LENGTH: 15 amino acids 
M (B) TYPE: amino acid 

m (C) STRANDEDNESS : single 

Cj (D) TOPOLOGY: linear 

s (ii) MOLECULE TYPE: None 

Kj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

H Glu Ile Agp Glu Asp Thr Qln Thr Thr Leu Glu Asp Phe Ser Arg 

tf] 1 5 10 I 5 

2 (2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
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Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn Met His Thr Glu His 

1 5 10 " 15 

His His His His His 
20 



D 

a 
m 
yi 



03 



ft 



